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(54) Load adaptive buffer management in packet networks 



(57) The setting of the queue thresholds in active 
queue management schemes such as RED (random 
early detection) is problematic because the required 
buffer size for good sharing among TCP connections is 
dependent on the number of TCP connection using the 
buffer. Techniques for enhancing the effectiveness of 
such buffer management schemes are described. The 
techniques dynamically change the threshold settings 
as the system load, e.g., the number of connections, 



changes. The invention uses variables that correlate 
well with system load. The variables should reflect the 
congestion notification rate since this rate is closely re- 
lated to the TCP congestion window size which in turn 
is closely related to the system load. This can be, for 
instance, a measured loss rate or can also be a com- 
puted value such as the drop probability. Using the tech- 
niques, routers and switches can effectively control 
packets losses and TCP time-outs which maintaining 
high link utilization. 
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Description 
Field of Invention 

100011 The present invention resides in the field of congestion control in packet networks. In particular it is directed 
Yc buffer management techniques to be used at a node such as a router, switch, gateway, server, etc that enable a 
S^SS^^ilc^ and dynamically to changes in offered load or avaiiable bandwidth so that the 
resources are better utilized and fairly shared by users. 

Background of Invention 

[00021 Congestion control in packet networks has proven to be a difficult problem, in general. The problem _is par- 
iruhSv challenqinq in the Internet where congestion control is mainly provided by end-to-end mechan.sms in TCP by 
fX rnu« «3?o thefr Queues a large factor in TCP's widespread use is its ability to adapt quickly to changes in 

that concept described herein is equal.y applicab.e to other mechanisms in which a s.m.lar congest.cn control 

SoT Th" performance of TCP becomes significant* degraded when the number of active TCP flows .exceeds £ 
network's bandwidth-delay product measured in packets. When the TCP sender's congesUon w.ndow oecomes less 
man 4 o ackete TCP is no longer able to recover from a single packet loss since the f^ r*^ 
r t "att P dupncate acknowledgments (ACKs) to get triggered. Thus, the congestion windows b^w4 .not amenab.e 
fo the fast-retransmit mechanism of TCP and a single packet less wil. send the connection ^JJ** resulting 
00041 With inadequate buffering, a large number of connections will tend to keep the buffers full and the resu Umg 
Lacket losses Some many of the connections into time-out. As link utilization grows, premature loss may occur long 
Tefore is achieved due to the bursty nature of IP traffic. Pine grain bandwidth ^ J^JK 

shar no is achieved over time intervals under 1 or2 seconds, is important for interactive appl.cat.ons, but ,s not possible 
avoid time-outs. Cne way to soive this prob.em is to provision route* witf ^ ™ 
L of buffering, but buffenng proportional to the total number of active flows. Many router vendo «^JJ^ 
round-trip time" buffering approach. Although this is a step in the right direct.cn. th.s onty ^dresses the link i * zat.on 
problem but not the packet loss prcb.em. It is important to note that the requirement to support ^m>^ 
S-affic is of interest for any large deployment of IP. including existing or planned commerc.al IP serv.ces. I has ; been 
™Sad in "TCP Behavior wrth .Many Flows" by R. Morris. IEEE International Conference Network Protocols. Atlanta. 
STZTs^TiZZs^ ten packets per flow is desirable. Large buffers shou.d be possib.e ,n routers 
Sneeze cost 2 memory is drooping rapidly due to demands in the computer industry. However, to ensure s.able 

ooeration larce buffers require more active forms of management than the tradrt.onal ta.l-drop. 

pSSST-fiSSe idea behind actfve queue management schemes such as RED is to ^^^^^ 
early and to convey congestion notification to the endsystems. thus allowing them to reduce their transmission rates 
(Close tie flow control windows) before queues in the network overflow and excessive numbens o packed are dropped. 
An article entitled "Random Early Detection Gateways for Congest™ Avo.dance" by Flcyd et al, lEc^ACM Transac 
tiens on Networking. Vol. 1. No. 4, Aug. 1993, pp. 397-413 describes the RED scheme. 
0006] The basic RED scheme (and its newer variants) maintains an average of the queue lengthJt th 
average and a number of queue thresholds to detect congestion. RED schemes drop mcom.ng packets m a random 
pobabLTc manner where the probabi.ity is a function of recent buffer Rl. history. 

equitable distribution of packet loss, avoid the synchronization of flows, and at the same ttme improve the J unto zanon 
he networ1< The setting of the queue thresholds in RED schemes is problematic because the buffer size for good 
s ^pendent on ! the number of TCP connections using the buffer. To keep late « ^^^^ 
Lsirable to set the thresholds low. But setting it too low will cause many t.me-outs, wh.ch drast ^ de ^ e ^ 
latency perceived by the user. On the other hand, settng the thresho.ds too high unnecessary .ncreases -he latency 
when operating with a small number of connections. This means the setting of the thresholds shou:d not be done 
an ad hoc manner but should be tied to the number of active connections sharing the same buffer 
*0007] It is important to note that high network utilization is only good when the packet loss «T^ Jn^ZZ 
High packet less rates can negative* impact overall network and end-user P- rt ^~^^ > S^ 
network resources before it is dropped, thereby impacting the efficiency .n other parts < f the „«wo r £»»%Z£*^ 
high packet loss rates also cause long and unpredictable delays as a result of TCP t.me-outs. 1 1 is £ reror des a 
to achieve high network utilization but with low packet loss rates. This means that even * " 
network to achieve -lirh utilization acprooriate steps must also be taken to ensure that packet los.es are low 
0008 ^ZZn^T^Z^s of RED and other simi.ar schemes if :he threshold sett^gs were dynam- 
ically changed as the number of connections changes. The article "Scalabie TCP Congest.on Control by R. Moms. 
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Ph.D. Thesis, Harvard University, available as of Dec. 3, 1999 at the author's website http://www. odos.ics.mit.edu/ 
-rtm/papers/tp.pdf describes a technique to control packet losses by dynamically changing the queue threshold setting. 
The technique requires counting the number of connections by inspecting packet control headers. This technique, 
however, is impractical in real core networks. A short article on the same subject having the same title is also available 
at the website. 

[0009] It is therefore envisaged that it would be better to have other measures for adjusting the thresholds in order 
to keep the loss rate at or below a specified value that would not cause too many time-outs. This would ensure robust 
operation of the data sources while also keeping the queue length as small as possible. The control strategy of the 
invention does not involve flow accounting complexities as in the afore-mentioned article by R. Morris. 
[0010] In one embodiment, by estimating the actual loss rate and changing the buffer management parameters (e. 
g. thresholds) accordingly, it can be ensured that the loss rate will be controlled around a pre-specified target loss rate 
value. As the number of connections increases, the threshold will be adjusted upward to maintain a loss rate that will 
not cause excessive time-outs. As the number of connections decreases, the threshold will be adjusted downwards 
to keep the latency at the router as small as possible. This is to prevent excessive queue buildup when the number of 
flows is low. In further embodiments, the actual loss rate can be estimated by using a computed value, such as a drcp 
probability or a measured vaiue of packet ioss over time. 

[001 1] Current TCP implementations expect that the router will drop packets as an indication of congestion. There 
have been proposals for indicating congestion by marking the packet rather than dropping it. It is also possible to 
indicate congestion by generating a congestion notification message directly back to the sender, thus avoiding the 
round trip delay. Such implementations can reduce the total buffer required per flow but still benefit from adjusting the 
buffer management to ensure that enough, but not too much, buffer is made available for the number of flows. 

Summary of Invention 

[0012] The invention therefore resides in the field of buffer management schemes of a packet network. In accordance 
with one aspect, the invention is directed to a method of managing a buffer at an outgoing link of a node. The method 
comprises steps of monitoring the status of a queue in relation to a queue threshold and generating congestion noti- 
;:caric !t o :c ca:a sources \r* rzz^cnsz to the status of £.3 queue. The "sthcd iurther induces steps of ccmpiiting an 
indication concerning a system load of the node from the rate of congestion notifications and adjusting the queue 
threshold in response to the indication to keep the operation of the data sources within a preferred envelope. 
[0013] In accordance with a further aspect, the invention is directed to a mechanism for managing a buffer at an 
outgoing link of a node in a packet network. The mechanism comprises a queue for buffering packets to be transmitted 
from the node onto the outgoing link and a first controller for monitoring the status of the queue with respect to a first, 
queue threshold and generating congestion notifications to data sources in response the status of the queue. The 
mechanism further includes a parameter estimation block for generating an indication concerning a system load of the 
node and a second controller for adjusting the first queue threshold in response to the indication to keep the operation 
of the data sources within a preferred envelope. 

Brief Description of Drawings 

[0014] Figure 1 is a schematic block diagram of an implementation according to an embodiment of the invention. 

[0015] Figure 2 is a schematic illustration of a two-level ccntrol strategy. 

[0016] Figure 3 is a schematic illustration of limiting the impact of setpoint changes. 

[0017] Figure 4 is a block diagram of a ramp unit. 

[0018] Figure 5 is a graph showing a drop probability as a function of average queue length. 

[0019] Figure 6 is a graph showing a RED probability parameter. 

Detailed Description of Preferred Embodiments of Invention 

[0020] The concept of the invention is to adjust thresholds of a queue at a node in relation to the system load (i.e., 
the number connections or flows). The main concepts will be described in detail in connection with active queue man- 
agement schemes e.g., RED : DRED (Dynamic Random Eariy Detection), etc.. which use random packet drop mech- 
anism. They are, however, general enough to be applicable to other similar queue management schemes and schemes 
which use packet marking or direct backward congestion notification messages. DRED is an improved algorithm for 
active queue management and is capable of stabilizing a router queua at a level independent of the number of active 
connections. An applicant's copending application entitled "Method and Apparatus for Active Queue Management 
Based on Desired Queue Occupancy" (filing particulars not available) has inventors common to the inventors of the 
present application and describes the DRED scheme in detail. 
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ro0211 This adaptation of queue thresholds to the system .oad is important because the requirec I buffer size (and 
XT™! 8 SEE = ~ «. rat. ,s «"-~"«^ 

?0 a 022r e T?e n aehavior of TCP is reviewed here. When packet ioss rate is low, TCP can sustain its sendfcg rate by the 
a t fetra^iS-reoovery mechanism. The ^^^^ 

modeMor tte assize of the congestion window w of a TCP connection in the presence of ioss ,s. 
where „ is the number of packets that are acknowledgec .by a receded 

qestion most of those packets will be stored in the congest.cn buffer. Therefore, if the oss ; rate , » 10 

Sold a target level over a wide range of connections, then in order to '^^^fE^ deduced from 

adapt the buffering according to the system load (that .s ^^^^^J^^Z^ 1* 

the above equation that buffering that automat-cally adapts o the number of flow* suggested C0U nting 

outs, queue .ength and maintains high utilization, is desirable. Prev.ous ™* S "*™™™^J^ problematic 

the number of flows by inspecting packet headers but th,s has some J^^J^^^Vlo estimate 

when encryption is present. Since the window s*e of a flow ,s dosely Mtoidtt "M*^*^ divjd . 

=0=0^^ 

SST' Figure 1 shows a block diagram of the contro. technique according to an -bodiment of ^^Jjj^ 
system 10 can be viewed as having two loops. There is an inner loop 12. ^^^^^^J^^ 
d'op controller 16, and an outer loop 1 8, which adjusts the packet drop con roller sy P stem 0 f 

conditions. Process 14 is shown to include TCP sources 20 where each * ^JE^J£K notifications, 
TCP traffic. Each is considered to form pan of two loops because ,t ^spends to RED ^^SSS. parameters 
which affect the packet arrival rate. The block diagram shows a P^^S7^S5^.gTi*« drc ? 
(e.g.. an indication of the system load) of the process based on ocservat-ons or ^^^P compute9 

probability 24) and if required, outputs (e.g., queue sizes 26). There ,s also a "^'^^^ OU iput of the 
seme of me packet drop controi.er parameters based on the output of the P^^^T^^ roe process 
controller design block (i.e., computed parameters) is then used to P^^^^^^^,^ when 
parameters are computed (or estimated) continuously and the packet drop controller parameters are p 
new parameters values (e.g.. a new indication of the system load) are stained. ket drop 

( 00251 Referring further to Figure 1 , the technique dynamically change the queue « ° number 0 f 
controller 1 6 as the number of connections in the queue change. Tne packet crop con ller ^sts to he 
connections by inspecting a load Indication (e.g. , drop probabH.ty, measured loss '^"^J^S decreases the 
to keep the loss rate at a value that would not cause too many fme-outs. ^™^%ZZ%£^ — sive 
threshold will be adjusted downwards to keep the latency at the router as small « ^»*J^ Z ° adopted, where a 
, queue buildup when the number of flows s low. In this embod.ment. a two- evel control * rateg/ * adop • 
high-level controller (operating in the outer loop 1 8 on a slower time scale) sets ^^^^ -nd • taw 
controller (operat.ng in the inner loop 12 on a faster time scale) computes the pac *et d op ^«*aort^ suggested 
that mere is sufficient buffering B such that the queue threshold(s) can be vaned as needed. It nas sugg 
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ltratlgy t,meS " ""^ " ** °' 8XpeCted "° WS bS pr ° vided - Fi 9 ure 2 iilus,rates the «*o-«~e« control 



[0026] The high-level controller can be viewed as a "quasi-static" or "quasistationary" controller. That is when there 
s svstem IT" h S ^T 83 ° r penurbaticns <*» *■ 'or example, a change in the number of connec fan Te c .) S 
2*7 !, 10 Se0le mt0 3 nSW $teady State before the aueue thresholds) is changed This ensures ha he 

hreshold(s) ,s not changed unnecessary to affect the computation (and stability of packet drop probiil 2 whfch* 

to fS k. JOt aCtUa ' ' OSS rate ^ meaSUred by obsen " n 3 the packet arrival process to the queue, or can be esti- 

Te o DRE RED? .XT^T^ ° f PaCkSt COntr °" er ' 3Vai,ab,e in S ° me buffer management schemes 
(e g OREO RED). In these buffer management schemes, the computed packet drop probability is a qood measure 
of the ac^al loss rate to be used at 40 (Figure 2) since it approximates asymptotical." the loss ate ve^ weT^e 

« [0028] The queue threshold(s) can be varied dynamically to keep the packet loss rate Cose to a pre-specified target 
f°-hTne^ e B ™ X S,n ? ' ^ time "° UtS VSry deDendent on ;oss «- ^te that a target loss rate can only be attained 
t~rlZ k Pr ° P l y eng,neered and there are * de <V*e "^sources (•■9- buffers, capacity, etc.). Most random 
packet d op schemes have typically two queue thresholds, an upper threshold T and a lower threshold L In one em 

20 S° d l me " I n T ■ l, ° n ' "If UPPSr threSh0W rb S8leCted 35 the mani P u| ating variable (to achieve the desired control 
20 behawtor), wh,le the lower threshold Z. is tied to the upper threshold through a simple I near relationship e g VtT 

later. The control target T can then be varied dynamically to keep the packet loss rate close to the pre-specified value 



G max- 

[0029] In a further embodiment, the measured or computed packet loss rate p, is filtered (or smoothed) to remove 
rans.ent components before being used in the high-level controller. The smoothed signal is obtained 22 ^ EWMA 
(exponent.ally we.ghted moving average) filter with gain y (more weight given to the history): 9 

Pi «- 0-Y)P,+ TP/, 0 <y< 1 

lo h r e exS approximated by % . If no filtering is required, then % =p, . The target loss rate is selected, 

Kh Jr°i a,90rith ^ S f0f d V namical 'y ^justing the threshold T to achieve the desired control performance are 
described below according to embodiments of the invention. 

Algorithm 1: 

[0031] 

40 Basic Mechanism: 

lf 'P/ ' 1 > £ continuously for 5 sec, then 



Sampie Implementation: 
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begin: gee new p g value 
stare = time_now 
while \Pi-d nax \>£ 

stop = time^now 

if stop - start >= 5 sec 

T <r- [T + A7\sgn[p, -^ife 
break out of while loo? 

endif 

get new p t value 

endwhile 
go to begin 

/* time_now is a free running system clock / 



where 



e is a tolerance value to eliminate unnecessary updates of 

5 is an elapse time used to check the loss «^T^ * **« int ° « bandS (- *' *" * 

AT is the control step size and is given as AT=B,K, i.e., tne ou 

10. 12, etc.) 



35 



40 



sgr. [.! denotes the sign of [.] 3 „ nw<s d to be close to 8 resulting in drop-tail behavior 

»"S r ,„oan be set to ooe Bandwidth-delay product worth of data. 

Algorithm 2: 
[0032] 

n ™eTe mre^otd r ,s on,, eha„ 9 ed «heo the tosa rate ft is etthe, abe». or date* the ta-ge, teas rate .„ 
and the loss rate is deviating from the target loss rate. 



Sample Implementation: 
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5 



every 5 seconds do the following: 
if p l (now) - 0 mas > 0.0 then 

if p, (now) - p t (previous) > 0.0 

increase T: T <-[t + &t]t~ 



10 



else do not change T 
else if p,(now)-0 nuxx <0.0 then 



if Oom') - pi (previous) < 0.0 



decrease 7; r^[r-Ar£~ 



else do not change T 



In the above procedure there is no notion of a loss tolerance e. Once the less rate goes above/celow the target loss 
rate, indicating that there is a drift from the target, the queue threshold T is then increased/decreased. Otherwise the 
threshold is "frozen^ since further changes can cause the loss rate to deviate further from the target loss rate. The 
deviations p^now) - p, (previous) in the above procedure can be computed over shorter sampling intervals (e.g., periods 
smaller than 5 seconds) or over longer intervals, depending on the measurement overhead allowed. 
[0033] In most buffer management schemes, the control loops have constant thresholds (or setpoints). But as dis- 
cussed above, the thresholds may change at certain time instances because of desires to change operating conditions 
such as user delays, loss rates, etc. A threshold is. as a result, typically piece-wise constant with changes occurring 
less frequently. It is therefore suitable to view the threshold as a step function. Since the threshold is a system distur- 
bance that can be accessed to, it is possible to feed it through a low-pass filter or a ramping module before it enters 
the low-level controller. In this way, the step function can be made smoother. This property can be useful, since most 
control designs having a good rejection of load disturbances give large overshoots after a sudden change in the thresh- 
old. Smoothing of the queue threshold is particularly important when the step change & Tis large. This way the command 
signals from the high-level controller can be limited so that setpoint changes are not generated at a faster rate than 
the system can cope. 

[0034] Figure 3 shows a scheme for limiting the impact of setpoint changes. In the Figure, a low pass filter or ramping 
unit 50 is located between low-level controller 52 and high-level controller 54. As in the earlier figures, two controllers 
take in same inputs and generate similar outputs except that the queue threshold(s) from the high-level controller is 
passed through filter 50 to smooth out the signal. 

[0035] Figure 4 is a block diagram of a ramp unit or rate limiter that can replace the low-pass filter. The output of the 
ramp unit will attempt to follow the input signals. Since there is an integral action in the ramp unit, the inputs and the 
outputs will be identical in steady state. Since the output is generated by an integrator with limited Input signal, the rate 
of change of the output will be limited to the bounds given by the limiter. Figure 4 can be described by the following 



equations: 



eft ~ 



sat(e) = sat(7-y) 



45 



in continuous-time domain 
and 



&y(n) = y(n) - y(n - 1) = sat (7 - y(n - 1)) 



50 



► in discrete-time domain. 



y(n) = y(n - 1) + sat(7 - y(n - 1)) 



The amplitude limiter or saturation "sat(e) :l is defined as 



55 
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-a, e < -a 


sat(e) = < 


e, \e\ < a 




a, eta 
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4Tlfl o_nv\r 0<a<1, if AT is large enough to 
„he,e the IM. a can be defined as a small tn««c*ot « «J yls L OT „o,hed thresnold input 

initialize y <- 0 
while e = 7-^*0 
_y 4- ^ + sat(e) 

pass yto the low-level controller 
wait for next y computing ciir.e 
endwhiie 

An embodiment using the D RED algorithm 

,0036] Asmen ti one d eanie,theO y namic^ 

uses a simple control-theoretic approach to st » S ^sources utilization, predictable delays. 

?0 0 C 3°; n ^ O e n act U a. queue size in the router is assumed to be ^^TX^T^Z. ftSSS 
Se picket drop controller provides a new valueoT *e packet time n, where n=1 

fh'affhe « terror signal ^, - ffi - help ma intain high Hnk utilization 

[00381 A lower queue threshold paramete L .s '^oduced hn ^ e c ° m ™ p Me lower than r, e.g.. L-bT.b 

and keep the queue size around the target level. The parameter LbJPJ Resource utilization and also not to 
1 ro B 0 91 OREO does not drop packets when q(n) < L in order to mamta n high ^resotree , ^ 
SSI penalize purees which are in the process of baking ^^^^St Vpacket drop. The 
is always a time iag between the time a packet . *°PP*«d » ^ pjnded (when. q(n) < L). 



DRED control parameters: 



45 



50 



55 



Control gain a: Filter gain p; Target buffer occupancy T(n); 
Minimum queue threshold L(n) = bT(n), b £ [0.8: 0.3] 



At time n: 



Sample queue size: q(n) 
Compute current error signal: e(n) = q(n) - T(n) 
Compute filtered error signal: e(n) = fl - p)*n - .) ♦ ^(n) 
Compute current packet drop probability: 



8 
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A/0) = 



/>./(»- 1) + a 



27*(n) 



/wax Si 



/0 



The 27^/i) term in the above computation is simply a normalization parameter so that the chosen control gain 
a can be preserved throughout the control process since 7" can vary. The normalization constant in the ORED 
algorithm is generally the buffer size B. 

Use pjn) as the packet drop probability until time n+1 , when a new p d is to be computed again 

Store e(n) and pjn) to be used at time n+1 . 



[0040] In DRED, a good measure of the actual packet loss rate is the packet drop probability pjn). pjn) converges 
asymptotically to the actual loss rate. Thus, pjn) approximates the actual loss rate very well. Therefore, Pt(n)=p d (n) 
is used in this embodiment as an indicator for varying the control target Tin order to reduce losses. The p f (n)=p d (n) 
?5 values are filtered as described earlier to obtain p, (n) which is then used in the high-level controller. The filtered values 
are computed as follows: 



Pi(n) <- (1 - Y)p, (n - 1) + ypjn), 0 < y< 1 . 

20 

An embodiment using the RED algorithm 

[0041] The RED maintains a weighted average of the queue length which it uses to detect congestion. When the 
average queue length exceeds a minimum threshold L, packets are randomly dropped with a given probability. The 
probability that a packet arriving at the RED queue is dropped depends on, among other things, the average queue 
length, the time elapsed since the last packet was dropped, and a maximum packet drop probability parameter maXp. 
When the average queue length exceeds a maximum threshold T, all arriving packets are dropped. 
[0242] Tha basic RED algorithm Is as shown in the following psaudo-code: RED control parameters: 
Filter gain W qi minimum queue threshold L; maximum queue threshold T; maximum packet drop probability max p . 

for each packet arrival 

calculate the new average queue size a\>g 
if L < avg < T 

calculate probability p a 
with probability p a : 

mark/drop the arriving packet 
else if T<avg 

crop the arriving packet 



where a^gis the average queue size, p a is the RED final packet drop probability. L and Tare the threshold parameters 

that control the average queue length and are defined by the network manager. 

[0043] The average queue size, avg, is maintained by the RED gateway using an EWMA filter 



50 avg^(1-w q ).avg + w q -q 

where w q is the filter gain, and q is the current queue size. The RED final drop probability p a is derived such that 

avg ■ L 
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Loss Behavior of the RED Algorithm 
follows: 

v n <r y < 1 which is defined as the normalized buffer fill. 
That is, p b is a linear function of the quantify X 0 S **J^^™ interdrop ping time r between two packets 
[0 0461 It is shown in the afore-ment.oned art.cle by Floyd et al »e mtt c . PP 9 proba bility Pb , 

follows a geometric distribution with parameter p b and mean £[t] = 1/p„ if each | pacKei « PP ket 
that is, ProW - m - d - P^-'P. The interdropping time , .s the number c tgfij^^lJS. d > pped 
unti. the next packet is dropped. For example, whan. max, 0.1 . 1 ou of every 10 p £ ^ 

when the average queue size is close to the max.mum threshold T (Ke X-^l )■ 9 ^ ^ 

,oss rate of 10%. Although 1/p„ packets are dropped on ^ bTSS 2 a. referenced above 

coal of drooping packets at fairly regular intervals over short perods The arte* oy Moy 
therefore proposes the computation of the RED final packetdrop prcbab.lrty as follows. 

Pb _ _J 

P '*~ 1-count.p b ±. count 
Pb 

The P ,ot of RED f.na. packet drop probability p a is therefore an °_ f 

the case of the temporary packet-marking probability p b or p d m DRE D. probability 
1 mean parking — 

maximum threshold (i.e.. X - 0.5), a .ess rate of 1 0 % ■ ' ^ aThie^d Nol assume that the m.nimum 

than 10% is achieved, while when X > 0.5, a loss rate h.gher than 10 .. s ach .eved. N 
threshold is set to zero, without loss of generality. The probability Pb , wn.ch w.H g^e a packet less rat- * 
this examp.c ('or a max p = 0.1 ), is given by: 

ava 1 ( 1 ) 

Pb = max p ■ -f = max p ■ X = max p . = § 

This results in : he average intermarking time between two packets (when P acke:s are marked with probability pj being 
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E[x] = J-=_L- packets. 
2p b max p 

s This generates an average packet loss rate of 

10 From this example, this corresponds to marking 1 cut of every 10 packets, achieving a loss rate of 10%. as expected. 

Therefore, as long as the relationship X= 0.5 in equation (1) above holds, a 10% packet loss rate will be enforced. 

However, it is impossible for this relationship to be maintained over time, since the average queue size changes over 

time (as a function of the system load), and the maximum threshold 7"is a static parameter in the basic RED technique. 

As a solution, one possibility is to scale the maximum threshold Tin relation to the average queue size avg. That is, 
is to maintain a max p = 0.1 in the above example (corresponding to a loss rate <j> = 10%), Tcan be varied accordingly in 

avg I Tto obtain the ratio of *h. 

[0049] Following are some observations regarding the control of packet loss using the RED algorithm: 

(1 ) Assuming that the target loss rate e max is 5% and max p is set to 5%. Then on average I out of every 20 packets 
20 must be marked/dropped. For this to be achieved we require the average intermarking time between two packets 

to be: 



EM = 5 — = 20 packets. 
25 * P *> 

This gives probability p b equal to 2.5%. Thus, to achieve the target loss rate, the probability p b needs to be main- 
:Sir;«ci jorsiant e ver tune. 7c do zz, tr^ rario cetwear. :r.o average queue size avg and maximum threshold T.nusi 
be equal to y k, as indicated by (1) for a max p set to 5%. If max p is set to a higher value (for the same target loss 
30 rate), then the ratio between the average queue size and maximum threshold must be smaller, whereas if max p 

is set to a lower value, the ratio must be larger. This means that regardless of the setting of max pt the ratio avgIT 
can be varied to obtain a constant p^ That is, to achieve a target loss rate for any setting of max pt we want to 
maintain 

35 Qyn 

p b = max p •— ^ = constant 

such that the target loss rate Q max = 1/ = 2p b is obtained. This suggests that max p is not a critical factor but 
rather maintaining a constant P b that results in the target loss rate of 9 max = 1/ £M = 2p b is more important. 
40 (2) The RED final packet drop probability p a is not a good representation of the loss rate as explained above, since 

it changes each packet arrival. However, filtering it (i.e., filtering all those exponential curves) gives us an estimate 
of the loss rate. The RED final packet drop probability p a is therefore filtered by using an EWMA filter with a gain 
cf 7 = 0.00005. That is, 

43 A A 

P/ = (1 -Y)P/ + YP a . 



[0050] This filtered quantity (i.e. , output of the parameter estimator) is input to the controller design block of the high- 
level controller. This value is used to adapt the maximum threshold T according to the algorithms described earlier. It 

so can be seen that when the traffic load increases/decreases, the average queue size increases/decreases, resulting in 
cranges of the quantity X (which is the ratio between the average queue size avg and maximum threshold 7). Ccnse- 
cuently. the probabilities p b and p a change, causing a deviation from the target loss rate. The algorithms described 
earlier are used to detect such deviations so that the threshold Tcan be changed accordingly. Basically, as illustrated 
in the previous example, if X increases beyond 'A due to a load increase, the filtered quantity will increase beyond the 

55 target loss rate of 5%. The maximum threshold Tis thus increased making X 10 converge back to 16, resulting in the 
loss rate converging back ro 5%. The same applies when there is a decrease in the system load. 
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Claims 

1 . in a packet nelwotk. a matnod of managing a buffer at an outgoing link of a noda oompnaing stapa of: 

b^oat^^ 
preferred envelope. 

2 . The method of managing a buffer at an outgoing link of a node according to ciaim 1 . wherein further comprising 
steps of: 

determine a deviation of the indication from a target value; and . . t , east a pre deter- 

adjusting the queue threshold in ,f the deviation is larger than a predeterm.ned value for at least pre 

mined duration of time. 

The method of managing a buffer at an outgoing link of a node according to claim 2. comprising a further step of: 
^nJSTLi^Uon notifications over time to derive a congestion notification rate parameter. 

The method of managing a buffer at an outgoing link of a node * ^2^25^ 

monitoring the congestion notifications at a predetermined sampling interval to denve a cong 

tion rate parameter 

The method el managing a buffer at an outgoing link of a node according to Cairn 2. comprising further steps of: 

performing a random early detection buffer management process; 
observing variables of the process; and 

computing from the variables the indication concerning the system load of the node. 
The method of managing a buffer at an outgoing .ink of a node according to claim 3, further comprising steps of: 



3. 
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monitoring a current queue size; threshold* and 

^rg.-^^^^ 

error signal. 

■■-^:=oo^ 

ability by using an exponentially weighted moving average filter with a predetermined gain. 

45 3 ^adjusting one of the two queue thresholds in proportion to the deviation. 

9 The method of managing a buffer at an outgoing link of a node according to claim 3. wherein the step of acting 
the queue threshold is performed through the use of smoothing block. 
so 10. The method of managing a buffer at an outgoing link of a node according to claim 4, further comprising steps of: 

mcnitorinq a current queue size; and 

computing an error Sl gna. in response to the current queue size and the ^^^ eter using the 

computing a current congestion notation probability as the congestion noff.cat.on rate paramet. 

55 error signal. 

1 1 The method of managing a buffer at an outgoing link of a node according to claim 1 0, ^ZanZTonlo^n 
filtering the current packet congestion notification probability to generate a smoothed congest.cn noOficat. 
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probability by using an exponentially weighted moving average filter with a predetermined gain. 

12. The method of managing a buffer at an outgoing link of a node according to claim 11 , wherein there are two queue 
thresholds and one being set to the other with a predetermined linear relationship, the method further comprising 
a step of: 

adjusting one of the two queue thresholds in proportion to the deviation. 

13. The method of managing a buffer at an outgoing link of a node according to claim 12, wherein the step of adjusting 
the queue threshold is performed through the use of smoothing block. 

14. The method of managing a buffer at an outgoing link of a node according to claim 5, further comprising steps of: 



monitoring a current queue size; 

computing an error signal in response to the current queue size and the queue threshold; and 
,5 computing a current congestion notification probability as the congestion notification rate parameter using the 

error signal; and 

using the congestion notification rate parameter as the variable from which the indication concerning the sys- 
tem load of the node is computed. 

20 is. The method of managing a buffer at an outgoing link of a node according to claim 1 4, further comprising a step of: 
filtering the current congestion notification probability to generate a smoothed congestion notification prob- 
ability by using an exponentially weighted moving average filter with a predetermined gain. 

1 6. The method of managing a buffer at an outgoing link of a node according to claim 1 5, wherein there are two queue 
25 thresholds and one being set to the other with a predetermined linear relationship, the method further comprising 

a step of: 

adjusting one of the two queue thresholds in proportion to the deviation. 

17. The method of managing a buffer at an outgoing link of a node according to claim 16, wherein the step of adjusting 
30 the queue threshold is performed through the use of smoothing block. 

18. The method of managing a buffer at an outgoing link of a node according to claim 3, further comprising steps of: 

monitoring an average queue size over time; 
35 counting the number of forwarded packets; and 

computing a current congestion notification probability as congestion notification rate parameter using the 
count and the average queue size in relation to the queue threshold. 



19. The method of managing a buffer at an outgoing link of a node according to claim 1 8, further comprising a step cf: 
filtering the current packet congestion notification probability' to generate a smoothed packet congestion 
notification probability by using an exponentially weighted moving average filter with a predetermined gain. 



20. The method of managing a buffer at an outgoing link of a node according to claim 1 9, wherein there are two queue 
thresholds and one being set to the other with a predetermined linear relationship, the method further comprising 

45 a step of: 

adjusting one of the two queue thresholds in proportion to the deviation. 

21 . The method of managing a buffer at an outgoing link of a node according to claim 20, wherein the step of adjusting 
the queue threshold is performed through the use of smoothing block. 



22. The method of managing a buffer at an outgoing link of a node according to claim 4, further comprising steps cf: 

moniroring an average queue size over time; 
counting the number of forwarded packets; and 

computing a current congestion notification probability as congestion notification rate parameter using the 
count and the average queue size in relation to the queue threshold. 

23. The method of managing a buffer at an outgoing link of a ncde according to claim 22, further comprising a step cf : 
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3 ^adjusting one of the two queue thresholds in proportion to the deviation. 

, 0 the queue threshold is performed through the use of smooth.ng block. 

«. The method of managing a buffer at an outgoing link of a node according to claim 5. further comprising steps of: 
monitoring an average queue size overtime; 

tem load of the node is computed. 

3 ^adjusting one of the two queue thresholds in proportion to the deviation. 
29 The method of managing a buffer at an outgoing link of a node according to claim 28. wherein the step of adjusting 

' the queue threshold is performed through the use of smooth.ng block. 
30. A mechanism for managing a buffer at an outgoing link of a node in a packet network comprising: 

congestion notifications to data sources in response the status of the queue; 
40 of the data sources within a preferred envelope. 

tions in accordance with the queue status. 
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32. The mechanism for managing a buffer at an outgoing link of a node in , a jacket network according to claim 31 . 
wherein the first controller operates at a faster speed than the second ccntro.ler. 

33. The mechanism for managing a buffer at an outgoing link of a node ^^^^^^^^ 
whereintheparameterestimationblock^^ 

notification probability, and an exponentially weighted mov.ng average filter «th a predeterm g 
transient component of the congestion notification probability 

34. The mechanism for managing a buffer at an outgoing link of a node ^^^^^^o 
wherein the second controller further comprises a smooth.ng block for smoothing .he acjustme 



threshold 



35. The mechanism for managing a buffer at an outgoing link of a node in a packet network according to claim 32. 
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wherein the smoothing block is either a low pass filter or a ramp unit. 

36. The mechanism for managing a buffer at an outgoing link of a node in a packet network according to claim 30 
threshoW mPnS,n9 3 SeC ° nd qU6Ue threSh °' d 3 predetermined linear relauonship with the first queue 
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